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Abstract 


We have hiked many miles alongside several professors as we traversed our statistical path - a regime switching 
trail which changed direction following a class on the foundations of our discipline. As we play the game of research 
in that limbo between student and academic, one thing among Prof. Bernardi's teachings has never been more clear: 
to draw a route in the research map you not only need to know your destination, but you must also understand where 
you are and how you arrived there. 


Where are we? 

Students who are set in their choice to study a spe¬ 
cific discipline since the beginning are like outliers. A 
few dots far away, determined and with clear ideas, 
which are likely to be generated from a latent pro¬ 
cess of decisions and education very different from 
that random mechanism matching the ambitions of 
the majority of students with their university route. 

We are no exception. Our decision to study Statis¬ 
tics resulted from the minimization of different loss 
functions, grossly calibrated and solved with few 
data. Since then it has been a shared journey, made 
of several hikes in which we have filled that bag of 
skills and knowledge to carry with us on the journey 
towards the future. A regime switching path, which 
converged towards the PhD: a common destination 
of one journey and a starting point of a new one. 

Among all these structural breaks in our statisti¬ 
cal journey, the one that occurred during a class on 
the foundations of Statistics was certainly one of the 
most decisive. An educational revolution leading us 
toward an inevitable paradigm shift, or better said, 
towards an ontological and epistemological enlarge¬ 
ment of the primitive methodological paradigm we 
had prior to that class. We were already capable of 
playing with several, sometimes even sophisticated, 
statistical techniques. But had we asked ourselves 
if these phenomena existed (ontology)? And if they 
existed, were we sure it was possible to learn them 
(epistemology)? If yes, how (methodology)? 


The answers to these questions are all "no", we 
had no idea about the response. Before studying 
the foundations of our discipline we were simply 
playing carefree in the Sciences' backyards, without 
wondering whether those gardens really existed or if 
statisticians were allowed to enter. We were like that 
conceited person who studies nature through statisti¬ 
cal techniques, before making sure, as Galileo Galilei 
claims, that it is truly written in a mathematical lan¬ 
guage whose fonts are circles, squares and geome¬ 
tries. 

Where were we? Was Statistics "dull, disreputable, 
prosaic and misleading [...], third rate discipline [...] a 
jackal picking over the bones and carcasses of the game that 
the big cats, the biologists, the physicists and the chemists, 
have brought down"? Or rather "a wonderful discipline, 
including mathematics and philosophy, analysis and em¬ 
piricism [...]. Which demands clear thinking, good judge¬ 
ment and flair [...], telling us how to turn data into de¬ 
cisions [...]. A Science dealing with the very essence of 
the universe: chance and contingency"? Among these 
two definitions given by Stephen Senn in Dicing with 
Death (2003), we had hope in the second one to 
prove to be true. That foundations class confirmed 
our expectations, while adding the further awareness 
that "data are not statistics as well as the marble is not 
the sculpture”. A sentence from Bernardo Colombo 
which Lorenzo Bernardi commented in his contri¬ 
bution Statistics and Mass Media (2001), highlighting 
how "both need to be modeled to gain intrinsic value and 
both somehow have the breath, originality and inspiration 


1 



The Compass for Statistical Researchers • January 2015 


of their creator ". 

This is the story of our journey across the foun¬ 
dations of Statistics. From the epistemological con¬ 
sciousness, towards the methodological reasoning, 
until reaching the understanding of the techniques. 
Later, we will go beyond the path, placing Statistics 
in the map of Sciences, while looking at its final goal: 
the communication. 

Back to the methodological future 

Prior to learning the foundations of Statistics, for us 
testing a statistical hypothesis was a simple proce¬ 
dure: simply subtracting a value from the mean, di¬ 
viding it by a measure of variability, and checking 
whether the result was within a certain rejection re¬ 
gion. We certainly had no idea that what we were 
doing was instead placed in a much wider method¬ 
ological framework. Our immediate declaration of war 
was in fact only the final moment of a richer research 
path which required, among others, a careful com¬ 
parison with existing contributions, the formulation 
of specific hypotheses, the identification of the aggre¬ 
gate to study, and the choice of adequate analysis 
instruments. But why did we need this researcher's 
shopping list? Our calculations would still have re¬ 
mained the same. 

Continuing our journey back to the future, the an¬ 
swer we gave to the previous question was "yes", the 
results would have been the same, but without the 
warranty of the method they would have had zero 
weight in the scientific progress. We started looking 
at the method as the insurance of knowledge. An "ars 
bene disponendi seriem plurimarum cogitationum" [art of 
ordering accurately multiple thoughts] framed within 
specific historical coordinates, and with clear epis¬ 
temological roots marking that invisible and some¬ 
times very thin border between science and non¬ 
science. Where were we then? 

In that ancient map representing the history of 
Science our coordinates were inductive inference, ex¬ 
perimental method, intersubjectivity of Science and cross¬ 
fertilization. A paradigmatic north-south-east-west 
where Statistics, intended as the Science aimed at 
balancing the empirical evidence with the hypothe¬ 
ses and conjectures of reason, was predestined to be¬ 
come the queen. We realized that our little calcula¬ 
tions were sons of a new scientific method arising 


during the 16th and 17th centuries from a renewed 
confidence in the human abilities. A period of Renais¬ 
sance that gradually broke the barrier between natu¬ 
ral and artificial in favor of a re-evaluation of the tech¬ 
nical and practical tools, and in which artists such 
as Brunelleschi, Mantegna and Leonardo da Vinci 
were also experts in urban planning, architecture 
and ballistics. A new worldview that abandoned the 
Aristotelic distinction between knowledge directed to 
practice and knowledge devoted to the contempla¬ 
tion of truth, favoring a much deeper experimental 
dialogue with the nature. 

Our little problems were thus the result of a new 
relationship with phenomena, no more characterized 
by a passive observation based on laws "in libris" [in 
books], but an experimental one. The statistical tech¬ 
niques we used to process our data became instead 
like the telescope Galileo pointed skyward in 1609. 
Reliable research tools in prolonging the senses to 
favor a deeper understanding of nature. We discov¬ 
ered, finally, that our hypothesis testing procedure 
was actually part of a more ambitious attempt of 
"secure naturam" [slicing the nature]. We felt like lit¬ 
tle Galileos involved in a research process that re¬ 
calls his study on falling bodies in 1608, where the 
comparison between each measured distance with 
what "should have been" (on the basis of previous the¬ 
ories) represented the close relationship between the¬ 
ory and meaningful experience. We were freeing our¬ 
selves from the concept of "auctoritas" [authority] to 
venture into a growing education towards the critical 
meaning and the reasonable interrogation of nature 
within the guidelines of the inductive inference. 

But, again, why was the method so important? 
The answer is that it can't be any other way, ac¬ 
cording to a notion of Science which favored cross¬ 
fertilization and introduced the intersubjectivity as a 
form of objectivity. Citing Leonardo da Vinci "those 
who love practice without theory are like the sailor who 
boards ship without a rudder and compass and never 
knows where he may cast." The method was the rud¬ 
der and the compass guaranteeing internal consis¬ 
tency, transparency and reproducibility. Essential el¬ 
ements to allow comparison and monitoring of devel¬ 
opments of knowledge in a slow process of progres¬ 
sive accumulations made by corrections and substi¬ 
tutions. An iceberg-shaped Science whose great dis¬ 
coveries and revolutions arise from a much broader 
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layer of minor theories and findings, sometimes fal¬ 
sified and not visible to human eyes. 

Hot data & doctrines of uncertainty 

Data and probability are the statistician's daily bread. 
Citing Edward Deming, it was clear to us from the 
beginning that " without data you are just another per¬ 
son with an opinion." However, we didn't expect 
that these two basic ingredients in the kitchen of the 
statistician were indeed so elaborate and delicate. 

For us, data were initially cold numbers that some¬ 
one gave us and which in turn we used to feed our 
statistical-probabilistic instruments to answer certain 
questions. During that foundations class, for the 
first time we started instead seeing them as hot in¬ 
formation, and their collection process as a further 
expression of the artistic value of our discipline. Con¬ 
sider the Social Sciences, where the statistical sur¬ 
vey became a slow artistic and painstaking process 
in which the statistician played cat-and-mouse with 
"chunks " of information so as to obtain the maximum 
quality from the raw material harvested. It was nec¬ 
essary to present the purpose of the survey and then 
consider general questions before moving slowly to¬ 
wards increasingly specific aspects introducing the 
interviewee to the " threatening " questions. A first date 
with data, where the statistician had to go through a 
series of ordered lovely actions, hoping that the so¬ 
cial phenomenon of interest showed itself in all sin¬ 
cerity. 

What about probability? We were able to play 
with several probabilistic tools. But from which re¬ 
mote planet was probability coming from? Or maybe 
we had learned it from ancient hieroglyphics? Was 
the concept universal? The first signboard along the 
fascinating trail that traces the historical and philo¬ 
sophical foundations of probability indicated it was 
coming from the planet New Scientific Method, which 
we had already explored. Citing Costantini's Histori¬ 
cal and Philosophical Foundations of Statistics and Proba¬ 
bility (2004), "the probability was the notion with which, 
during the breakthrough that led to the modern era, we 
tried to discover the laws of phenomena characterized by 
uncertain behavior ." 

Our Martians were therefore Laplace, von Mises, 
Jeffreys and De Finetti. The hieroglyphs were in¬ 
stead written on books like Laplace's Theorie analy- 


tique des probabilities (1812), Von Mises' Wahrschein- 
lichkeit, Statistik und Wahrheit (1928), De Finetti's Sul 
significato soggettivo della probability (1931) and Jef¬ 
frey's Theory of Probability (1939). For us probability 
was no longer just a number between 0 and 1, which 
expressed a measure of certainty of an event, but took 
the shape of a history about the beliefs in the uncertainty. 
A branching process starting from the fact that nature 
is not is deterministic, and later branching out in dif¬ 
ferent doctrines to define its uncertainty. 

The way we initially viewed probability as the 
ratio of the favorable cases to the total number of 
cases was only one possible definition, the classical 
one formalized by Laplace. What about the others? 
We had unconsciously been using one of these defi¬ 
nitions, the Von Mises' frequentist one, since our early 
classes in inference when we considered probabil¬ 
ity in terms of the limit of a relative frequency with 
which an event occurs in a very large (infinite) popu¬ 
lation. Having a frequentist background, we saw the 
other two, the De Finetti's subjectivist and Jeffrey's 
logicist definitions, as perhaps more surprising but 
no less interesting. The first placed the subject in 
the foreground, defining the probability as a degree 
of confidence that an individual assigns to the occur¬ 
rence of an event according to his prior knowledge. 
The second instead defined probability as a logical 
relationship that exists between something known 
and something that is not, highlighting the objectiv¬ 
ity (outside the subject) of the relationships. 

The question at this point was: why should we 
proceed in our journey with this historical baggage on 
probability? Simply because each of the paradigms 
of statistical inference arises from the aforemen¬ 
tioned definitions of probability. We realized how 
the Fisherian approach inherited the frequentist con¬ 
cept of probability in exploiting information from the 
observed data (seen as realizations from a true un¬ 
known generative process) to appropriately estimate 
the generative mechanism and evaluate how sensi¬ 
tive this reconstruction was to the fact that observed 
data were only one sample of many possible. The 
frequency-decision theory adopted the same defini¬ 
tion of probability, but changed the main focus of 
statistical inference to provide rules for action in sit¬ 
uations of uncertainty. The subjective Bayesian ap¬ 
proach was instead epistemologically and method¬ 
ologically different in explicitly studying how sub- 
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jects' initial confidence (prior knowledge) about the 
uncertain generative mechanism changed in the light 
of data. It was clear therefore how the subjectivist def¬ 
inition of probability was more suitable within this 
paradigm. The methodology remained mostly simi¬ 
lar in the objectivist Bayesian approach, with an epis¬ 
temological shift from prior knowledge to prior igno¬ 
rance, progressively abandoning the subjective idea 
to reach the objective one stressed by the logicist def¬ 
inition. 

Far from the scope of providing a detailed and 
comparative overview of these paradigms (see Bar¬ 
nett's Comparative Statistical Inference (1999) for an 
extensive discussion), the main lesson we learned 
was that the choice of each paradigm of statistical in¬ 
ference carries specific definitions of probability we 
should be aware of in order to avoid confusion in 
choosing our techniques and in drawing our conclu¬ 
sions. 

An ancillary science 

Now that we had found our coordinates in the map 
of Science and read the instructions of some toys we 
enjoyed playing with each day, we could fully un¬ 
derstand a definition of Statistics often repeated by 
Lorenzo Bernardi. An ancillary science. A discipline 
that is fundamental to all other Sciences. The main 
actor in that slow process of donning with new cer¬ 
tainties and doffing of falsified thesis that we had 
come to know by the name of scientific progress. Fi¬ 
nally, not only did we appreciate Tukey's claim “the 
best thing about being a statistician is that you get to 
play in everyone's backyard," but we could also give 
a name to each of these backyards, understanding at 
the same time when we could enter and in which 
game we could play. We even discovered that these 
gardens had much more to offer than we initially 
thought. 

There was the garden of Medicine, made for ex¬ 
ample of growth curves, odds, and survival mod¬ 
els. In the Sports and Betting backyard we found 
sophisticated methods to model sports results; see 
also contributions in the third issue of volume 27 of 
Chance. The garden of Finance and Economy was 
covered with refined micro- and macro-econometric 
models and roller coasters of time series. At the 
end of the race the watchword was forecast. Cer¬ 


tainly Sharpe, Miller and Markowitz in 1990 and En¬ 
gle and Granger in 2003 must have enjoyed more 
than anyone else winning the Nobel Prize for eco¬ 
nomics. Prediction, especially of extreme events, was 
a key also in the backyard of Natural Sciences to pro¬ 
vide tools for planning in agriculture and to antici¬ 
pate natural disasters. In the garden of Industry we 
met, among others, experimental design and control 
charts for evaluating the conformity of the products 
with their required characteristics to avoid unpleas¬ 
ant surprises for customers. The latter became kings 
in the garden of Marketing, flooded with tons of data 
from the web which only data mining techniques 
could have been able to transform into useful infor¬ 
mation. Actors within certain social phenomena re¬ 
placed customers in the park of Social Sciences, stud¬ 
ded with social indicators, networks and many other 
sociometric instruments. 

This was only one wing of a huge Neverland for 
statisticians, embracing all fields of knowledge such 
as physics, astronomy, chemistry, biology, psychol¬ 
ogy, archeology and many others. A Disney World 
that is constantly renewed to keep up with the times, 
new technologies and issues; where Statistics be¬ 
comes Senn's (2003) " wonderful discipline that includes 
mathematics and philosophy, analysis and empiricism," 
and the new attractions are Neuroscience, Bioinfor¬ 
matics, Computer Science, Web Marketing, Senti¬ 
mental Analysis and Applied Criminology. 

An amusement park we wanted to celebrate in 
the Statistical Calendar (cal.stat.unipd.it/index.html) 
as well as in the video My Statistician Friend 
(www. youtube.com/watch?v=yU2qQywUnnU). 

Two dissemination projects perhaps naive, but pas¬ 
sionate, which we would have never considered with¬ 
out having attended that foundations class. 

The sexy job? 

“I keep saying the sexy job in the next ten years will be 
statisticians. People think I’m joking." At this point 
in our journey we have no doubts that the chief 
economist of Google Hal Varian was not joking. How 
can a discipline that combines art, science and tech¬ 
nology not be sexy? However, the more we looked 
at the collective idea of Statistics, the more it seemed 
to be commonly perceived as Stephen Senn's (2003) 
"nasty old lady, you don't know her, but she loathes you 
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already," or "like Australia, everyone knows where it is 
but no one wants to go there. Except that people do want 
to go to Australia ." 

Every time we found ourselves discussing Statis¬ 
tics among students from other fields, we noticed 
how after a few minutes they started surfing far 
away in a slow decaying process converging every¬ 
where except in a neighborhood centered around us 
and with radius "from here we do not listen to what 
you are speaking about." The few brave hearts who 
rarely remained started asking a series of questions 
randomly sampled from that set of conceptualiza¬ 
tions and stereotypes, such as the famous Trilussa's 
chicken or newer versions like the one claiming that 
if you have your head in the oven and your feet in 
the freezer then your averaged body temperature is 
within the limits of healthy living. 

In that complex classification tree grouping stu¬ 
dents according to their field of study, we found our¬ 
selves in an isolated branch representing the semi- 
known disciplines. This fact sounded extremely dis¬ 
sonant. If the goal of Statistics is to transform data 
into information, why wasn't this information being 
acknowledged? We wanted to understand why, after 
playing in the Sciences backyards, they didn't come 
up and see our etchings. The answer we gave to this 
question was that it wasn't enough to have etchings 
in our room, but it was also necessary to know how 
to correctly present them and how to properly invite 
Sciences. 

In that slow process of reconciliation with the 
other Sciences we understood the need to avoid 
messy floods of data and instead provide transparent 
information, favoring the opportunity to recognize 
our conceptual choices and report the general frame¬ 
work our analyses and interpretations came from. In 
parallel, it was important to create an airlock to bal¬ 
ance the need to inform, sometimes with sophisti¬ 
cated methods of Statistics, with the aloof and sus¬ 
picious attitude of some interlocutors who had to 
transform information into decisions. This armistice 
would have been possible only when the first would 
abandon its arrogance and, quoting George Box, the 
"bad habit of falling in love with its models," while the 
latter would have changed that attitude ranging be¬ 
tween quantitative mythology and censorship. 

How could we avoid the arrogance ? Simply ac¬ 
cepting the idea that many fields of statistical re¬ 


search are relative. Citing Bernardi (2001) "the spirit 
of statistical research can be explained by this story: to the 
friend who asks how's the wife, the economic statistician 
will reply: compared to when? while the social statistician 
will ask: with respect to whom?" Part of our arrogance 
came from the need to always provide absolute an¬ 
swers resulting from naive conclusions, rather than 
proper exploitation of the analyses. 

Our checklist ended with an apology of simplicity: 
we had to throw out the high-tech weapons and first 
learn to hit little birds with a slingshot instead of 
a cannon. Developing sophisticated statistical mod¬ 
els was certainly much more stimulating than the 
slow process of verification of hypotheses and re¬ 
sults somehow already present in the collective ideas. 
However, as Bernardi (2001) claims, "while the first is 
based on a scientific approach which sometimes proves pre¬ 
carious and naive, the second is always a solid starting 
point for new curiosities and fruitful intuitions." 

An exercise of "restating the obvious so as such re¬ 
mains," to train ourselves in the effective commu¬ 
nication of finest methods, balancing sophistication 
and simplicity. Two not necessarily rival aspects, but 
difficult to reconcile for fear of trivializing the con¬ 
tribution of Statistics. Our masterpieces should not, 
therefore, have been merely an expression of Pop 
artists, but they also had to reflect the careful work 
of copyists monks engaged in that fundamental work 
of "translation and simplification of teaching, easily acces¬ 
sible for the reader." 

Only by accepting this responsibility will we 
make peace with the other Sciences, allowing them 
to appreciate our etchings. Conversely if we do not 
take proper care in communication, then we will re¬ 
main that conceited guy admiring himself in the mir¬ 
ror without realizing he is indeed alone in a corner 
while all the Sciences are enjoying the dance floor of 
scientific progress. 
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